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Abstract 



^ . 

■ This paper studies prototypical strategies to sequentially aggregate independent decisions. We con- 
' sider a collection of agents, each performing binary hypothesis testing and each obtaining a decision 

cN : 

over time. We assume the agents are identical and receive independent information. Individual decisions 
. are sequentially aggregated via a threshold-based rule. In other words, a collective decision is taken as 

■ soon as a specified number of agents report a concordant decision (simultaneous discordant decisions 
y5 ' and no-decision outcomes are also handled). 

We obtain the following results. First, we characterize the probabilities of correct and wrong decisions 



. as a function of time, group size and decision threshold. The computational requirements of our approach 

■ are linear in the group size. Second, we consider the so-called fastest and majority rules, corresponding 

in ■ 
cn 

I accuracy and decision time. In the limit of large group sizes, we show that the decision time for the fastest 

, rule converges to the earliest possible individual time, and that the decision accuracy for the majority 

■ rule shows an exponential improvement over the individual accuracy. Additionally, via a theoretical and 
numerical analysis, we characterize various speed/accuracy tradeoffs. Finally, we relate our results to 
some recent observations reported in the cognitive information processing literature. 



I. Introduction 

A. Problem setup 

Interest in group decision making spans a wide variety of domains. Be it in electoral votes in politics, 
detection in robotic and sensor networks, or cognitive data processing in the human brain, establishing 
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the best strategy or understanding the motivation behind an observed strategy, has been of interest for 
many researchers. This work aims to understand how grouping individual sequential decision makers 
affects the speed and accuracy with which these individuals reach a collective decision. This class of 
problems has a rich history and some of its variations are studied in the context of distributed detection 
in sensor networks and Bayesian learning in social networks. 

In our problem, a group of individuals independently decide between two alternative hypothesis, and 
each individual sends its local decision to a fusion center. The fusion center decides for the whole 
group as soon as one hypothesis gets a number of votes that crosses a pre-determined threshold. We 
are interested in relating the accuracy and decision time of the whole population, to the accuracy and 
decision time of a single individual. We assume that all individuals are independent and identical. That 
is, we assume that they gather information corrupted by i.i.d. noise and that the same statistical test is 
used by each individual in the population. The setup of similar problems studied in the literature usually 
assumes that all individual decisions need to be available to the fusion center, before the latter can reach 
a final decision. The work presented here relaxes this assumption and the fusion center might provide 
the global decision much earlier than the all individuals in the group. Researchers in behavioral studies 
refer to decision making schemes where everyone is given an equal amount of time to respond as the 
"free response paradigm." Since the speed of the group's decision is one of our main concerns, we adjust 
the analysis in a way that makes it possible to compute the joint probabilities of each decision at each 
time instant. Such a paradigm is referred to as the "interrogation paradigm." 

B. Literature review 

The framework we analyze in this paper is related to the one considered in many papers in the literature, 
see for instance |[D,ll2l,ll3l,||4l,||5l,||6l,|l7l,|l8l,||9land references therein. The focus of these works is 
mainly two-fold. First, researchers in the fields aim to determine which type of information the decision 
makers should send to the fusion center. Second, many of the studies concentrate on computing optimal 
decision rules both for the individual decision makers and the fusion center where optimality refers to 
maximizing accuracy. One key implicit assumption made in numerous works, is that the aggregation rule 
is applied by the fusion center only after all the decision makers have provided their local decisions. 

Tsitsiklis in IT] studied the Bayesian decision problem with a fusion center and showed that for large 
groups identical local decision rules are asymptotically optimal. Varshney in lH proved that when the 
fusion rules at the individuals level are non-identical, threshold rules are the optimal rules at the individual 
level. Additionally, Varshney proved that setting optimal thresholds for a class of fusion rules, where a 
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decision is made as soon as a certain number q out of the group members decide, requires solving 
a number of equations that grows exponentially with the group size. The fusion rules that we study in 
this work fall under the q out of N class of decision rules. Finally, Varshney proved that this class of 
decision rules is optimal for identical local decisions. 

C. Contributions 

The contributions of this paper are three-folds. First, we introduce a recursive approach to characterize 
the probabilities of correct and wrong decisions for a group of sequential decision makers (SDMs). These 
probabilities are computed as a function of time, group size and decision threshold. The key idea is to 
relate the decision probability for a group of size N at each time t, to the decision probability of an 
individual SDM up to that time t, in a recursive manner. Our proposed method has many advantages. First, 
our method has a numerical complexity that grows only linearly with the number of decision makers. 
Second, our method is independent of the specific decision making test adopted by the SDMs and requires 
knowledge of only the decision probabilities of the SDMs as a function of time. Third, our method 
allows for asynchronous decision times among SDMs. To the best of our knowledge, the performance 
of sequential aggregation schemes for asynchronous decisions has not been previously studied. 

Second, we consider the so-csLWed fastest and majority rules corresponding, respectively, to the decision 
thresholds q = 1 and q = \N/2\ . For these rules we provide a comprehensive scalability analysis of both 
accuracy and decision time. Specifically, in the limit of large group sizes, we provide exact expressions 
for the expected decision time and the probability of wrong decision for both rules, as a function of the 
decision probabilities of each SDM. For t\\e. fastest rule we show that the group decision time converges 
to the earliest possible decision time of an individual SDM, i.e., the earliest time for which the individual 
SDM has a non-zero decision probability. Additionally, the. fastest rule asymptotically obtains the correct 
answer almost surely, provided the individual SDM is more likely to make the correct decision, rather 
than the wrong decision, at the earliest possible decision time. For the majority rule we show that the 
probability of wrong decision converges exponentially to zero if the individual SDM has a sufficiently 
small probability of wrong decision. Additionally, the decision time for the majority rule is related to the 
earliest time at which the individual SDM is more likely to give a decision than to not give a decision. This 
scalability analysis relies upon novel asymptotic and monotonicity results of certain binomial expansions. 

As third main contribution, using our recursive method, we present a comprehensive numerical analysis 
of sequential decision aggregation based on the q out of N rules. As model for the individual SDMs, we 
adopt the sequential probability ratio test (SPRT), which we characterize as an absorbing Markov chain. 
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First, for the fastest and majority rules, we report how accuracy and decision time vary as a function 
of the group size and of the SPRT decision probabiUties. Second, in the most general setup, we report 
how accuracy and decision time vary monotonically as a function of group size and decision threshold. 
Additionally, we compare the performance of fastest versus majority rules, at fixed group accuracy. We 
show that the best choice between the fastest rule and the majority rule is a function of group size and 
group accuracy. Our numerical results illustrate why the design of optimal aggregation rules is a complex 
task ifTOl . Finally, we discuss relationships between our analysis of sequential decision aggregation and 
mental behavior documented in the cognitive psychology and neuroscience literature |[TT|| . |[T2l . |[T3l . 

m. 

Finally, we draw some qualitative lessons about sequential decision aggregation from our mathematical 
analysis. Surprisingly, our results show that the accuracy of a group is not necessarily improved over the 
accuracy of an individual. In aggregation based on the majority rule, it is true that group accuracy is 
(exponentially) better than individual accuracy; decision time, however, converges to a constant value for 
large group sizes. Instead, if a quick decision time is desired, then the fastest rule leads, for large group 
sizes, to decisions being made at the earliest possible time. However, the accuracy of fastest aggregation is 
not determined by the individual accuracy (i.e., the time integral of the probability of correct decision over 
time), but is rather determined by the individual accuracy at a specific time instant, i.e., the probability 
of correct decision at the earliest decision time. Accuracy at this special time might be arbitrarily bad 
especially for "asymmetric" decision makers (e.g., SPRT with asymmetric thresholds). Arguably, these 
detailed results for fastest and majority rules, q = 1 and q = \_N/2\ respectively, are indicative of the 
accuracy and decision time performance of aggregation rules for small and large thresholds, respectively. 

D. Decision making in cognitive psychology 

An additional motivation to study sequential decision aggregation is our interest in sensory information 
processing systems in the brain. There is a growing belief among neuroscientists |[T2l . |[T3l . |[T4l that 
the brain normally engages in an ongoing synthesis of streams of information (stimuli) from multiple 
sensory modalities. Example modalities include vision, auditory, gustatory, olfactory and somatosensory. 
While many areas of the brain (e.g., the primary projection pathways) process information from a single 
sensory modality, many nuclei (e.g., in the Superior CoUiculus) are known to receive and integrate 
stimuli from multiple sensory modalities. Even in these multi-modal sites, a specific stimulus might 
be dominant. Multi-modal integration is indeed relevant when the response elicited by stimuli from 
different sensory modalities is statistically different from the response elicited by the most effective of 
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those stimuli presented individually. (Here, the response is quantified in the number of impulses from 
neurons.) Moreover, regarding data processing in these multi-modal sites, the procedure with which 
stimuli are processed changes depending upon the intensity of each modality-specific stimulus. 

In |[T2l . Werner et al. study a human decision making problem with multiple sensory modalities. They 
present examples where accuracy and decision time depend upon the strength of the audio and visual 
components in audio-visual stimuli. They find that, for intact stimuli (i.e., noiseless signals), the decision 
time improves in multi-modal integration (that is, when both stimuli are simultaneously presented) as 
compared with uni-sensory integration. Instead, when both stimuli are degraded with noise, multi-modal 
integration leads to an improvement in both accuracy and decision time. Interestingly, they also identify 
circumstances for which multi-modal integration leads to performance degradation: performance with an 
intact stimulus together with a degraded stimulus is sometimes worse than performance with only the 
intact stimulus. 

Another point of debate among cognitive neuroscientists is how to characterize uni-sensory versus 
multi-modal integration sites. Neuro-physiological studies have traditionally classified as multi-modal 
sites where stimuli are enhanced, that is, the response to combined stimuli is larger than the sum of the 
responses to individual stimuli. Recent observations of suppressive responses in multi-modal sites has 
put this theory to doubt; see |[T3l . |[T4l and references therein. More specifically, studies have shown that 
by manipulating the presence and informativeness of stimuli, one can affect the performance (accuracy 
and decision time) of the subjects in interesting, yet not well understood ways. We envision that a more 
thorough theoretical understanding of sequential decision aggregation will help bridge the gap between 
these seemingly contradicting characterization of multi-modal integration sites. 

As a final remark about uni-sensory integration sites, it is well known |[T5l that the cortex in the 
brain integrates information in neural groups by implementing a drift-diffusion model. This model is the 
continuous-time version of the so-called sequential probabiUty ratio test (SPRT) for binary hypothesis 
testing. We will adopt the SPRT model for our numerical results. 

E. Organization 

We start in Section|lI]by introducing the problem setup. In Section |lll] we present the numerical method 
that allows us to analyze the decentralized Sequential Decision Aggregation (SDA) problem; We analyze 
the two proposed rules in Section JV] We also present the numerical results in Section |Vl Our conclusions 
are stated in Section |Vll The appendices contain some results on binomial expansions and on the SPRT. 
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II. Models of sequential aggregation and problem statement 

In this section we introduce the model of sequential aggregation and the analysis problem we want 
to address. Specifically in Subsection III-AI we review the classical sequential binary hypothesis testing 
problem and the notion of sequential decision maker, in Subsection III-BI we define the q out of N 
sequential decisions aggregation setting and, finally, in Subsection III-C[ we state the problem we aim to 
solve. 

A. Sequential decision maker 

The classical binary sequential decision problem is posed as follows. 

Let H denote a hypothesis which takes on values Hq and Hi. Assume we are given an individual 
(called sequential decision maker (SDM) hereafter) who repeatedly observes at time t = 1,2,..., a 
random variable X taking values in some set X with the purpose of deciding between Hq and Hi. 
Specifically the SDM takes the observations x{l),x{2),x{3), . . ., until it provides its final decision at 
time r, which is assumed to be a stopping time for the sigma field sequence generated by the observations, 
and makes a final decision 6 based on the observations up to time r. The stopping rule together with 
the final decision rule represent the decision policy of the SDM. The standing assumption is that the 
conditional joint distributions of the individual observations under each hypothesis are known to the 
SDM. 

In our treatment, we do not specify the type of decision policy adopted by the SDM. A natural way 
to keep our presentation as general as possible, is to refer to a probabilistic framework that conveniently 
describes the sequential decision process generated by any decision policy. Specifically, given the decision 
policy 7, let x'q^ and Xi'^ be two random variables defined on the sample space N x {0, 1} U {?} such 
that, for i,j G {0, 1}, 

• {x^j'^ = represents the event that the individual decides in favor of Hi at time t given that 
the true hypothesis is Hj; and 

• {Xj^^ =?} represents the event that the individual never reaches a decision given that Hj is the 
correct hypothesis. 

Accordingly, define p^jjj{t) and p^^j*^. to be the probabilities that, respectively, the events {x^j'^ = 
and {xo^^ =?} occur, i.e, 

it) = P[xi^^ = (t, i)] and p^^. = PfxJ-") =?] . 
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Then the sequential decision process induced by the decision policy 7 is completely characterized by 
the following two sets of probabilities 

{P%} U W W^^'S W},,, and {p%} U {pg W,pSlJ(t)}^^^ , (1) 

where, clearly p^^^ + E*=i (p^li) +P^^iiit)) = 1 and p^])^ + Et=i + pIJ^ (O) = 1- m what 

follows, while referring to a SDM running a sequential distributed hypothesis test with a pre-assigned 
decision policy, we will assume that the above two probabilities sets are known. From now on, for 
simplicity, we will drop the superscript (7). 

Together with the probability of no-decision, for j G {0, 1} we introduce also the probability of correct 
decision p^^j := P[say Hj \ Hj] and the probability of wrong decision p^^^j := P[say Hi, i ^ j \ Hj], that 
is, 

00 00 

P<:\i = ^Pj\j(^) and p^\j = ^Pi\j{t), i / j. 
t=i t=i 

It is worth remarking that in most of the binary sequential decision making literature, p^^i and p^^Q are 
referred as, respectively, the mis-detection and false-alarm probabilities of error. 

Below, we provide a formal definition of two properties that the SDM might or might not satisfy. 

Definitioii II.l For a SDM with decision probabilities as in ([T]), the following properties may be defined: 

(i) the SDM has almost-sure decisions if for j G {0, 1}, 

00 

{Po\jii) = 1> and 

t=i 

(ii) the SDM has finite expected decision time if for j G {0,1}, 

00 

Yt{po\j{t) +Pl\jit)) < 00. 

t=i 

One can show that the finite expected decision time implies almost-sure decisions. 

We conclude this section by briefly discussing examples of sequential decision makers. The classic 
model is the SPRT model, which we discuss in some detail in the example below and in Section |Vl 
Our analysis, however, allows for arbitrary sequential binary hypothesis tests, such as the SPRT with 
time-varying thresholds |[T6l . constant false alarm rate tests fTTTl . and generalized likelihood ratio tests. 
Response profiles arise also in neurophysiology, e.g., ifTSl presents neuron models with a response that 
varies from unimodal to bimodal depending on the strength of the received stimulus. 

Example II.2 (Sequential probability ratio test (SPRT)) In the case the observations taken are inde- 
pendent, conditioned on each hypothesis, a well-known solution to the above binary decision problem is 
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the so-called sequential probability ratio test (SPRT) that we review in Section |Vl A SDM implementing 
the SPRT test has both the almost-sure decisions and finite expected decision time properties. Moreover 
the SPRT test satisfies the following optimality property: among all the sequential tests having pre- 
assigned values of mis-detection and false-alarm probabilities of error, the SPRT is the test that requires 
the smallest expected number of iterations for providing a solution. 

In Appendices IB 1 1 and IB 2 1 we review the methods proposed for computing the probabilities 
when the SPRT test is applied, both in the case X is a discrete random variable and in the case X is a 
continuous random variable. For illustration purposes, we provide in Figure[T]the probabilities Pi\j{t) when 
j = I for the case when X is a continuous random variable with a continuous distribution (Gaussian). 
We also note that Pi\j{t) might have various interesting distributions. 
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Fig. 1. This figure illustrates a typical unimodal set of decision probabilities {pi\i{t)}t€m and {poii (t)}teN. Here the SDM 
is implementing the sequential probability ratio test with three different accuracy levels (see Section |V] for more details). 



B. The q out of N decentralized hypothesis testing 

The basic framework for the binary hypothesis testing problem we analyze in this paper is the one in 
which there are N SDMs and one fusion center. The binary hypothesis is denoted by H and it is assumed 
to take on values Hq and Hi. Each SDM is assumed to perform individually a binary sequential test; 
specifically, for i G {1, . . . , N}, at time t G N, SDM i takes the observation Xi{t) on a random variable 
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Xi, defined on some set Xi, and it keeps observing Xi until it provides its decision according to some 
decision policy 7i. We assume that 

(i) the random variables {Xi}^^ are identical and independent; 

(ii) the SDMs adopt the same decision policy 7, that is, 7^ = 7 for alH G {1, . . . , N}; 

(iii) the observations taken, conditioned on either hypothesis, are independent from one SDM to another; 

(iv) the conditional joint distributions of the individual observations under each hypothesis are known 
to the SDMS. 

In particular assumptions (i) and (ii) imply that the decision processes induced by the N SDMs are 
all described by the same two sets of probabilities 

{Pnd|o} U {P0|0(0>Pl|0(*)}tgK {Pnd\l} U {Po|l (*) > Pl| 1 (*) j^g^ " 

We refer to the above property as homogeneity among the SDMs. 

Once a SDM arrives to a final local decision, it communicates it to the fusion center. The fusion center 
collects the messages it receives keeping track of the number of decisions in favor of Hq and in favor of 
Hi. A global decision is provided according to a g out of N counting rule: roughly speaking, as soon 
as the hypothesis Hi receives q local decisions in its favor, the fusion center globally decides in favor 
of Hi. In what follows we refer to the above framework as q out of N sequential decision aggregation 
with homogeneous SDMs (denoted as q out of N SDA, for simplicity). 

We describe our setup in more formal terms. Let N denote the size of the group of SDMs and let q 
be a positive integer such that I < q < N, then the q out of N SDA with homogeneous SDMs is defined 
as follows: 

SDMs iteration : For each i G {!,..., N}, the i-th SDM keeps observing Xi, taking the observations 
Xi{l),Xi{2), . . . , until time Tj where it provides its local decision di G {0, 1}; specifically = if 
it decides in favor of Hq and = 1 if it decides in favor of Hi. The decision di is instantaneously 
communicated (i.e., at time r^) to the fusion center. 

Fusion center state : The fusion center stores in memory the variables Counto and Counti, which are 
initialized to 0, i.e., Con?ito(0) = Counti{Q) = 0. If at time t G N the fusion center has not yet 
provided a global decision, then it performs two actions in the following order: 
(1) it updates the variables CountQ and Counti, according to CountQ{t) = CountQ{t — 1) +no(t) 
and Counti{t) = Counti{t— 1) + ni{t) where no(t) and ni{t) denote, respectively, the number of 
decisions equal to and 1 received by the fusion center at time t. 
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(2) it checks if one of the following two situations is verified 

{Counti{t) > Counto{t), I Counti{t) < Count(){t). 

{ii) i (3) 
Counti{t) > q, I Counto{t) > q. 

If (i) is verified the fusion center globally decides in favor Hi, while if (ii) is verified the fusion 

center globally decides in favor of Hq. Once the fusion center has provided a global decision the q 

out of N SDA algorithm stops. 

Remark II.3 (Notes about SDA) (i) Each SDM has in general a non-zero probability of not giving 
a decision. In this case, the SDM might keep sampling infinitely without providing any decision 
to the fusion center. 

(ii) The fusion center does not need to wait until all the SDM have provided a decision before a 
decision is reach on the group level, as one of the two conditions (i) or {ii) in equation [3] might 
be satisfied much before the SDM provide their decisions. 

(iii) While we study in this manuscript the case when a fusion center receives the information from all 
SDM, we note that a distributed implementation of the SDA algorithm is possible. Analysis similar 
to the one presented here is possible in that case. □ 

C. Problem formulation 

We introduce now some definitions that will be useful throughout this paper. Given a group of N 
SDMs running the q out of N SDA algorithm, 1 < < A^, we denote 

(i) by T the random variable accounting for the number of iterations required to provide a decision 

T =min{t | either case (i) or case (ii) in equation ([3]) is satisfied}; 

(ii) by Piy{t; N,q) the probability of deciding, at time t, in favor of Hi given that Hj is correct, i.e., 

Pi\j{t; N, q) := P [Group of N SDMs says Hi \ Hj, q,T = t]; (4) 

(iii) by Pc\j{N, q) and p^\j{N, q) the probabiUty of correct decision and of wrong decision, respectively, 
given that Hj is the correct hypothesis, i.e., 

oo oo 

Pc\j{N, q) = ^Vj\j{t] N, q) and p^\jiN, q) = ^Pi\j{t\ N, q), i ^ j; (5) 
t=i t=i 

(iv) by Pnd|j(-^) <?)' j £ {0)1}' the probability of no-decision given that Hj is the correct hypothesis, 

i.e., 

oo 

Pnd\j{N,q) := 1 - {Po\j{t;N,q)+pi\j{t;N,q)) = l-p^\j{N,q) -p,\j{N,q); (6) 
t=i 
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(v) by E \T\Hj, N, q] the average number of iterations required by the algorithm to provide a decision, 
given that Hj is the correct hypothesis, i.e., 

E[T\Hj,N,q]:= I (7) 
[+00, ifp„i\j{N,q)>0. 

Observe that Pi\j{t; 1, 1) coincides with the probability Pi\j{t) introduced in ([T||. For ease of notation we 
will continue using Pi\j{t) instead of Pi\j{t; 1, 1). 

We are now ready to formulate the problem we aim to solve in this paper. 

Problem II.4 (Sequential decision aggregation) Consider a group of N homogeneous SDMs with de- 
cision probabilities {pnd|o} U {po|o(*)>Pi|o(*)}tgN ^""^ {^'nd|i} U {PG\i{t),Pi\i{t)] Assume the N 
SDMs run the q out of N SDA algorithm with the purpose of deciding between the hypothesis Hq and 
Hi. For j G {0,1}, compute the distributions {p^j{t;N,q)^^^^ as well as the probabilities of correct 
and wrong decision, i.e., Pc\jiN,q) and p^^j{N,q), the probability of no-decision Pndlji^jQ) '^'^'^ th£ 
average number of iterations required to provide a decision, i.e., E [T|ifj , q\. 

We will focus on the above problem in the next two Sections, both through theoretical and numerical 
results. Moreover, in Section ITVl we will concentrate on two particular values of q, specifically for g = 1 
and q = \_N/2\ + 1, characterizing the tradeoff between the expected decision time, the probabilities of 
correct and wrong decision and the size of the group of SDMs. When q = I and q = , we will 

refer to the q out of N rule as the fastest rule and the majority rule, respectively. In this case we will 
use the following notations 

P%{N) := p,y{N; q = 1), ^^^.(iV) := p^y{N; q = 1) 

and 

pJ^(iV) ■■=Pc\j{N;q= [N/2\ + 1), p5;)(iV) :=p,|^.(Ar;g = lN/2\ + 1). 

We end this Section by stating two propositions characterizing the almost-surely decisions and finite 
expected decision time properties for the group of SDMs. 

Proposition II.5 Consider a group of N SDMs running the q out of N SDA algorithm. Let the decision- 
probabilities of each SDM be as in For j £ {0, 1}, assume there exists at least one time instant 
tj G N such that both probabilities Po\j{tj) and Pi\j{tj) are different from zero. Then the group of SDMs 
has the almost-sure decision property if and only if 
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(i) the single SDM has the almost-sure decision property; 

(ii) N is odd; and 

(iii) q is such that 1 < q< \N/2\. 

Proof: First we prove that if the group of SDMs has the almost-sure decision property, then properties 
(i), (ii) and (iii) are satisfied. To do so, we show that if one between the properties (i), (ii) and (iii) fails 
then there exists an event of probability non-zero that leads the group to not provide a decision. First 
assume that the single SDM does not have the almost-sure decision property, i.e., > 0, j G {0, 1}. 
Clearly this implies that the event "all the SDMs of the group do not provide a decision " has probability 
of occurring equal to which is strictly greater than zero. Second assume that N is even and consider 
the event "at time tj, N/2 SDMs decide in favor of Hq and N/2 SDMs decide in favor of Hi". Simple 
combinatoric and probabilistic arguments show that the probability of this event is {j^^2) Po/j'^Pi/j'^^ which 
is strictly greater than zero because of the assumption Poijiij) / and Pi\j{tj) ^ 0. Third assume that 
q > [N/2\ + 1. In this case we consider the event "at time tj, SDMs decide in favor of Hq 

and [N/2\ SDMs decide in favor of Hi" that, clearly, leads the group of SDMs to not provide a global 
decision for any q > [N/2\ + 1. Similarly to the previous case, we have that the probability of this event 



We prove now that if properties (i), (ii) and (iii) are satisfied then the group of SDMs has the almost- 
sure decision property. Observe that, since each SDM has the almost-sure decision property, there exists 
almost surely a A^-tuple {ti, . . . ,tj\f) G such that the i-th SDM provides its decision at time tj. 
Let t := max{tj | i G {!,..., A^}}. Since N is odd, then Counti{t) ^ CountQ{t). Moreover since 
Q < L^/2j + 1 and Counti{t) + CountQit) = N, either Counti{t) > q or CountQ{t) > q holds true. 
Hence the fusion center will provide a global decision not later than time t. ■ 

Proposition II.6 Consider a group of N SDMs running the q out of N SDA algorithm. Let the decision- 
probabilities of each SDM be as in (O. For j G {0, 1}, assume there exists at least one time instant 
tj G N such that both probabilities piy{tj) and Pi\j{tj) are different from zero. Then the group of SDMs 
has the finite expected decision time property if and only if 

(i) the single SDM has the finite expected decision time property; 

(ii) N is odd; and 

(iii) q is such that 1 <q< \N/2~\. 

Proof: The proof follows the lines of the proof of the previous proposition. ■ 
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Remark II.7 The existence, for j G {0,1}, of a time tj such that Po\j{tj) 7^ and Pi\j{tj) 7^ 0, is 
necessary only for proving the "if" side of the previous propositions. In other words the validity of 
properties (i), (ii) and (iii) in Proposition III.5I (resp. in Prop. III. 61 ) guarantees that the group of SDMs 
possesses the almost-sure decision property (resp. the finite expected decision time property.) □ 

III. Recursive analysis of the q-o\]j-0¥-N sequential aggregation rule 

The goal of this section is to provide an efficient method to compute the probabilities Piy{t; N,q), 
i,j G {0, 1}. These probabilities, using equations ([S]), Q and ^ will allow us to estimate the probabilities 
of correct decision, wrong decision and no-decision, as well as the expected number of iterations required 
to provide the final decision. 

We first consider in subsection IIII-AI the case where I < q < [N/2\; in subsection IIII-B I we consider 
the case where [N/2\ +1 <q< N. 

A. Case l<q< [N/2\ 

To present our analysis method, we begin with an informal description of the decision events charac- 
terizing the q out of N SDA algorithm. Assume that the fusion center provides its decision at time t. 
This fact implies that neither case (i) nor case (ii) in equation ^ has happened at any time before t. 
Moreover, two distinct set of events may precede time t, depending upon whether the values of the 
counters County and Counti at time t — 1 are smaller than q or not. In a first possible set of events, say 
the "simple situation," the counters satisfy < CountQ{t — l),Counti{t — I) < q — I and, hence, the 
time t is the first time that at least one of the two counters crosses the threshold q. In a second possible 
set of events, say the "canceling situation," the counters CountQ{t — 1) and Counti{t — 1) are greater 
than q and, therefore, equal. In the canceling situation, there must exist a time instant f < t — 1 such that 
Counto{f — 1) < q, Counti{f — I) < q and Counto{T) = Counti{T) > g for all r G {f + 1, . . . , t — 1}. 
In other words, both counters cross the threshold q at the same time instant f reaching the same value, 
that is, CountQ^f) = Counti{f), and, for time r G {r + 1, . . . , t — 1}, the number no(r) of SDMs 
deciding in favor of Hq at time r and the number ?ii(r) of SDMs deciding in favor of Hi at time r 
cancel each other out, that is, no(T) = ni(T). 

In what follows we study the probability of the simple and canceling situations. To keep track of 
both possible set of events, we introduce four probability functions, a, /3, a, /3. The functions a and /3 
characterize the simple situation, while a and ^ characterize the canceling situation. First, for the simple 
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situation, define tlie probability function a : N x {0, . . . , g — 1} x {0, . . . , g' — 1} — )■ [0, 1] as follows: 
given a group of sq + si SDMs, a{t, so, si) is the probability that 

(i) all the sq + si SDMs have provided a decision up to time t; and 

(ii) considering the variables Counto and Counti restricted to this group of so+si SDMs , Counto{t) = 
Sq and Countiit) = si. 

Also, define the probabiUty function : N x {0, . . . , g — 1} x {0, . . . , g - 1} — [0, 1], j G {0, 1} as 
follows: given a group of — (sq + si) SDMs, sq, si) is the probability that 

(i) no SDMs have provided a decision up to time t — 1; and 

(ii) considering the variables County and Counti restricted to this group of — (sq + si) SDMs, 
CountQ[t) + So < Counti[t) + si, and Counti{t) + si > q. 

Similarly, it is straightforward to define the probabilities /3q|j, j G {0, 1}. 

Second, for the canceling situation, define the probability function a : N x {g, . . . , [A^/2j} — )• [0, 1] as 
follows: given a group of 2s SDMs, a{t,s) is the probability that 

(i) all the 2s SDMs have provided a decision up to time t; and 

(ii) there exists f <t such that, considering the variables Counto and Counti restricted to this group 
of 2s SDMs 

• Counto{f — I) < q and Counti{f — 1) < q; 

• Countoir) = Counti^r) > q for all r > r. 

Also, define the probabiUty function : N x {g', . . . [A^/2j} — > [0, 1], j G {0, 1} as follows: given a 
group of A^ - 2s SDMs, s) is the probability that 

(i) no SDMs have provided a decision up to time t — 1; and 

(ii) at time t the number of SDMs providing a decision in favor of Hi is strictly greater of the number 
of SDMs providing a decision in favor of Ho- 

Similarly, it is straightforward to define the probabilities (Sq^, j G {0, 1}. 

Note that, for simplicity, we do not explicitly keep track of the dependence of the probabilities /3 
and P upon the numbers A^ and q. The following proposition shows how to compute the probabilities 
{pi\j{t; N,q)^'^_^, i,j G {0, 1}, starting from the above definitions. 

Proposition III.l (q out of A^: a recursive formula) Consider a group of N SDMs, running the q 
out of A^ SDA algorithm. Without loss of generality, assume Hi is the correct hypothesis. Then, for 
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i £ {0, 1}, we have, for t = 1, 



p,|i(l;A^,(?) = A|i(l,0,0), 



(8) 



and, for t > 2, 



Pi\i{t;N,q) =Y,Y. \ . a(t-l,So,si)A|i(i,So,si)+ J]] ( a(t-l,s)A|i(t,s). (9) 





Proof: The proof that formulas in ([8]) hold true follows trivially form the definition of the quantities 
(1,0,0) and /3o|i(l, 0, 0). We start by providing three useful definitions. 
First, let Et denote the event that the SDA with the q out of N rule provides its decision at time t in 
favor of Hi. 

Second, for sq and si such that < sq, si < q — 1, let Esg^s^^t denote the event such that 

(i) there are sq SDMs that have decided in favor of Hq up to time t — 1; 

(ii) there are si SDMs that have decided in favor of Hi up to time t — 1; 

(iii) there exist two positive integer number tq and ri such that 

• So + ro < si + ri and si +ri > q. 

• at time t, vq SDMs decides in favor of Hq while ri SDMs decides in favor of Hi 
Third, for q < s < [N/2\, let Eg^t denote the event such that 

(i) 2s SDMs have provided their decision up to time t — 1 balancing their decision, i.e., there exists 
f < t — I with the properties that, considering the variables Count- and Count-^. restricted to 
these 2s SDMs 

• CountQ{T) < q, Counti{T) < q, for 1 < t < r — 1; 

• CountQ[T) = Counti^r) for f < t < t — I; 

• Counto{t — 1) = Counti{t — 1) = s. 

(ii) at time t the number of SDMs providing their decision in favor of Hi is strictly greater than the 
number of SDMs deciding in favor of Hq. 

Observe that 




Since E. 



,^^ < So, Si < q — I, and Eg^t, <? < -s < [N/2\ are disjoint sets, we can write 






(10) 



0<so,si<q—l 



q<s<lN/2\ 
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Observe that, according to the definitions of a{t — l,so,si), a{t — l,s), (3i\i{t, sq, si) and (3i\i{t,s), 
provided above, 

^[Eso,sJ=( ^ )a{t-l,so,si)PMt,so,si) (11) 



Sl + So 



and that 



^[Es,t]= (^^)a(t-l,s)^i|i(t,s). (12) 

Plugging equations (fTTI) and (fT2l) into equation (ITOl ) concludes the proof of the Theorem. ■ 
Formulas, similar to the ones in ^ and ^ can be provided for computing also the probabilities 

{Pi\o{t;N,q)}^^^,i€{0,l}. 

As far as the probabilities a{t, sq, si), a{t, s), fii\j{t, sq, si), s), i,j G {0, 1}, are concerned, we 

now provide expressions to calculate them. 

Proposition III.2 Consider a group of N SDMs, running the q out of N SDA algorithm for \ < q < 
\_N/2\. Without loss of generality, assume Hi is the correct hypothesis. For i G {0, 1}, let 7rj|x : N — )• [0, 1] 
denote the cumulative probability up to time t that a single SDM provides the decision Hi, given that 
Hi is the correct hypothesis, i.e., 

t 

7T,li{t) = Y,Pi\l{t). (13) 

s=l 

For t G N, So, Sl G {1, . . . , g— 1}, s G {g, . . . , [N/2\ }, the probabilities a(t, sq, si), a{t, s), /3i|i(i, so, si), 
and /3i|i(t, s) satisfy the following relationships (explicit for a, j3, f3 and recursive for a): 

a(t,so,si)= ('° + '^)7r^j',(t)7rJf,(t), 

"(M) = i; E (^^';^J('^;^°-^^)a(t-i,so,sOp^r-(tKr/^ 

So — Sl — 

/3iii(Mo,si)= y: r~%'\,ii)\f: ('^"'"^^)po'iiW(i--iiiw--oiiw)''-^"-'"-'^ 

hi=i ^ ^ ^ -n V / 



L?io=0 



where s = so + si, m = min{/ii + si — sq — 1, — (so + si) — /ii} anJ rh = min{/ii — 1, — 2s — /ii}. 
Moreover, corresponding relationships for /3o|i(t, so, si) anJ ;So|i(t,s) are obtained by exchanging the 
roles o/pi|i(t) w/f/i Po|i(^) ''^ ^'^^ relationships for /3i|i(t, sq, si) anJ s). 
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Proof: The evaluation of a(t, sq, si) follows from standard probabilistic arguments. Indeed, observe 
that, given a first group of so SDMs and a second group of si SDMs, the probability that all the SDMs 
of the first group have decided in favor of Hq up to time t and all the SDMs of the second group have 
decided in favor of Hi up to time t is given by 'K^^^^{t)T:^^{t) . The desired result follows from the fact 
that there are (^^j^^") ways of dividing a group of sq + si SDMs into two subgroups of sq and si SDMs. 

Consider now a;(i,s). Let Ea[t,s) denote the event of which a{t,s) is the probability of occurring, 
that is, the event that, given a group of 2s SDMs, 

(i) all the 2s SDMs have provided a decision up to time t; and 

(ii) there exists f < t such that, considering the variables County and Counti restricted to this group 
of 2s SDMs 

• Counto{f — 1) < q and Counti{f — 1) < q; 

• CountQ[T) = Counti{T) > q for all r > r. 

Now, for a group of 2s SDMs, for < sq, si < g — 1, let £^t-i,so,si denote the event that 

(i) So (resp. si) SDMs have decided in favor of Hq (resp. Hi) up to time t — 1; 

(ii) s — So (resp. s — si) SDMs decide in favor of Hq (resp. Hi) at time t. 

Observing that for so + si assigned SDMs the probability that fact (i) is verified is given by a{t — l, sq, si) 
we can write that 

nEt-i,s„,sA = (^^'^'^J '^)a(t - i,so,siKr-(tKr,-(t). 

Consider again a group of 2s SDMs and foiq<h<s let Ef^i^h denote the event that 

(i) 2h SDMs have provided a decision up to time t — 1; 

(ii) there exists f < t — 1 such that, considering the variables Counto and Counti restricted to the 
group of 2h SDMs that have already provided a decision, 

• CountQ{f — 1) < q and Counti{f — 1) < q; 

• Counto{T) = Counti{T) > q for all t > f; and 

• Counto{t — 1) = Counti{t — 1) = h; 

(iii) at time instant t, s — h SDMs decide in favor of Hq and s — h SDMs decide in favor of Hi. 
Observing that for 2h assigned SDMs the probability that fact (i) and fact (ii) are verified is given by 
a{t — l,h), we can write that 

^t^*-^''^] = (2O Cs - fy^' - wri'(*)- 



August 30, 2010 



DRAFT 



18 



Observe that 

/ g 9 \ /LJV/2J _ \ 

^^(M) = U U U U ^^-i.'^ ■ 

\so=Osi=0 / \ h=q J 

Since the events Et-i^so,si> < so,si < <? and Et^i^h, q < h < [N/2\, are all disjoint we have that 

(J-l <J-1 s 

So=Osi=0 h=q 

Plugging the expressions of F[Et-i^so,si] and F[Et^i^h] in the above equality gives the recursive rela- 
tionship for computing a{t,s). 

Consider now the probability sq, si). Recall that this probability refers to a group of iV — (sq + 

si) SDMs. Let us introduce some notations. Let -E'/3i|i(f,so,si) denote the event of which Pi^i{t, sq, si) 
represents the probability of occurring and let Et.hi,si,ho,so denote the event that, at time t 

• hi SDMs decides in favor of Hi, 

• ho SDMs decides in favor of Hq; 

• the remaining N — {sq + si) — {ho + hi) do not provide a decision up to time t. 

Observe that the above event is well-defined if and only if h^ + hi < N — {sq + si). Moreover Et-}i^^si,ho,so 
contributes to /5i|i(t, sq, si), i-e., Et-h^,s^,ho,so ^ ^/3i|i(t,so,si) if only '^^ hi > q - si and /iq < 
hi + si — So (the necessity of these two inequalities follows directly from the definition of /3i|i(t, so, si))- 
Considering the three inequalities + hi < N — {sq + si), hi > q — si and ho < hi + si — sq, it 
follows that 

-^/3iii(t,so,si) = U {EfM,si,ho,so \ q - Si < hi < N - {so + Si) and ho<m}, 

where m = min{/ii + si — so — 1) — (•so + si) — hi}. To conclude it suffices to observe that the events 
Et;hi.si,ho,so foT q — Si < hi < N — (so + Si) and ho < m are disjoint events and that 

nEfM,s.,K.s.] = ("^ : y^iiiit) ("^ ~l~ ^')p^ii{t) (1 - ni^iit) - .oiiit))''-'-'"-'' , 

where s = so + si. 

The probability /3i|i(t,s) can be computed reasoning similarly to Pi\i{t, sq, si). ■ 
Now we describe some properties of the above expressions in order to assess the computational 

complexity required by the formulas introduced in Proposition IIIL 1 1 in order to compute {pi\j{t; N, g)}^^, 

i,j G {0, 1}. From the expressions in Proposition |in.2| we observe that 

• a(t,so,si) is a function of 7ro|i(t) and 7ri|i(t); 

• a{t, s) is a function of a{t — l, sq, si), < sq, si < g — 1, Po|i(*)' Pi|i(*) and a{t — l, h), q < h < s; 
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. /3i|i(t,so,si), i G {0,1}, are functions of po|i(i), Vi\i{t), T^Q\i{t) and 7ri|i(t). 
Moreover from equation (fT3] ) we have that 'T^i\j{t) is a function of vrj|j(t — 1) and Pi\j{t). 

Based on the above observations, we deduce that Po\i{t',N,q) and Pi\i{t;N,q) can be seen as the 
output of a dynamical system having the {[N/2\ — q + 3)-th dimensional vector with components the 
variables vro|i(t — 1), vri|i(t — 1), a{t — I, s), q < h < [N/2\ as states and the two dimensional vector 
with components Po|i(*), Pi|i(*), as inputs. As a consequence, it follows that the iterative method we 
propose to compute {pi\j{t; N,q)}'^^, i,j G {0, 1}, requires keeping in memory a number of variables 
which grows linearly with the number of SDMs. 

B. Case lN/2\ + l<q<N 

The probabilities p^j{t; N,q), i,j G {0, 1} in the case where [N/2\ + I < q < N can be computed 
according to the expressions reported in the following Proposition. 

Proposition III.3 Consider a group of N SDMs, running the q out of N SDA algorithm for \_N/2\ +1 < 
q < N. Without loss of generality, assume Hi is the correct hypothesis. For i G {0, 1}, let : N — )• [0, 1] 
be defined as (1131 ). Then, for i G {0, 1}, we have for t = 1 

p,,i(l;iV,g) = ^ mP^i(l) {l-p.iAl)f-' (14) 

h=q ^ ^ 

and for t > 2 

p,,(t;iV,,)=£(^)4(t-l) J: ('';%^,i{t){l-n,,it)f-^'^'\ (15) 

fc=0 ^ ^ h=q-k ^ ^ 

Proof: Let t = 1. Since q > N/2, the probability that the fusion center decides in favor of Hi at 
time t = 1 is given by the probability that al least q SDMs decide in favor of Hi at time 1. From standard 
combinatoric arguments this probability is given by (fT4l) . 

If t > 1, the probability that the fusion center decides in favor of Hi at time t is given by the probability 
that h SDMs, < h < q, have decided in favor of Hi up to time t — 1, and that at least q — h SDMs 

(i) 

decide in favor of Hi at time t. Formally let El denote the event that the fusion center provides its 

(i) 

decision in favor of Hi at time t and let Ef^ ^.^, denote the event that k SDMs have decided in favor 
of Hi up to time t — 1 and h SDMs decide in favor of Hi at time t. Observe that 

^ = V} U 

fc=0 h-q-k 
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(i) 

Since -E'^^ffct-i disjoint sets it follows that 



9-1 iV-fc 
A:=0 h=q—k 



The proof is concluded by observing that 



^h,t;k,t-l 



Regarding the complexity of the expressions in (fTSl) it is easy to see that the probabilities Pi\j{t; N, q), 
i, j G {0, 1} can be computed as the output of a dynamical system having the two dimensional vector with 
components 7ro|i(i— 1), 7ri|i(i— 1) as state and the two dimensional vector with components Po|i(i),Pi|i(t) 
as input. In this case the dimension of the system describing the evolution of the desired probabilities is 
independent of A^. 

IV. Scalability analysis of the fastest and majority sequential aggregation rules 

The goal of this section is to provide some theoretical results characterizing the probabilities of being 
correct and wrong for a group implementing the q-out-of-N SDA rule. We also aim to characterize the 
probability with which such a group fails to reach a decision in addition to the time it takes for this 
group to stop running any test. In Sections [IV- Al and IIV-BI we consider the fastest and the majority rules, 
namely the thresholds q = 1 and q = respectively; we analyze how these two counting rules 

behave for increasing values of N . In Section |IV-C[ we study how these quantities vary with arbitrary 
values q and fixed values of A^. 



A. The fastest rule for varying values of N 

In this section we provide interesting characterizations of accuracy and expected time under the fastest 
rule, i.e., the counting rules with threshold q = 1. For simplicity we restrict to the case where the group 
has the almost-sure decision property. In particular we assume the following two properties. 

Assumption IV.l The number N ofSDMs is odd and the SDMs satisfy the almost-sure decision property. 

Here is the main result of this subsection. Recall that p^^^{N) is the probability of wrong decision by a 
group of N SDMs implementing the fastest rule (assuming Hi is the correct hypothesis). 
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Proposition IV.l (Accuracy and expected time under the fastest rule) Consider the q out of N SDA 

algorithm under Assumption \IV.1\ Assume q = 1, that is, adopt the fastest SDA rule. Without loss of 
generality, assume Hi is the correct hypothesis. Define the earliest possible decision time 



t := min{t G N | either Pi\i{t) ^ or Po\i{t) ^ 0}. 
Then the probability of error satisfies 

0, >Po|i©, 



(16) 



lim p^^) (AT) = } 



1, ifPi\i(t)<Po\i{t), 
^, if Pi\i{t) = Po\i{t), 



(17) 



and the expected decision time satisfies 



lim E[r|i?i, = 1] = t. 

N^oo 



(18) 



Proof: We start by observing that in the case where the fastest rule is applied, formulas in ^ 
simplifies to 

Pili{t;N,q = 1) = /3i|i(f,0,0), for all t £ N. 

Now, since Pi\i{t) = Po|i(*) = for t < i, it foUows that 

N,q = l)= 0, 0) = 0, t< i. 

Moreover we have vri|i(t) = Pi\i(t) and ■KQ\i(t) = Po\i{t). According to the definition of the probability 
/3i|i(f, 0, 0), we write 

N /^r\ ( rn. /^r -x 1 



where m = min {j — 1, A^ — j}, or equivalently 

^^^^^ /N\ f^'"VAr - • 

0,0) = E L- )pi\i(^ E / ]Po\i(*) (1 -Po|i(i)) 
j=l ^ ^ U=o ^ 



N-i-j 



N 



j=\N/2] 



N 



■N-j 



i=0 



N-j 



P^|i(i)(l-Pi|i(*)-Po|i(i)) 



N-i-j 



[N/2\ 



N 



E r Hi.® E 



TV 



. i=0 



j=\N/2] ^ 



N-J 



Po\i{t) {I - Pi\i{i) - Po\i{t)) 



N-i-j 



N-j 



(19) 
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An analogous expression for /3o|i(t, 0, 0) can be obtained by exchanging the roles of PQ\i{i) and Po\i{t) 
in equation ([T9l ). The rest of the proof is articulated as follows. First, we prove that 

lim {p,^,{t;N,q = 1) + p^iiit; N , q = 1)) = lim (/3i|i(t, 0, 0) + /3o|i(t, 0, 0)) = 1. (20) 

This fact implies that equation (fTSl) holds and that, if P\\\{i) = Po\i{t)^ then limAr_j.oo = 1/2- 

Indeed 

oo 

lim E [T\H^,N, q = l]= lim V t(po|j(t; N,q = l) N, q = 1)) = t. 

t=l 

Moreover, if Pi\i{t) = Po\i{t), then also (/3i|i(t, 0, 0) = /3o|i(t, 0, 0). 

Second, we prove that Pi|i(t) > Po\i{t) implies liniTv-^oo /3o|i(i; 0, 0) = 0. As a consequence, we have 
that limAT^oo 0, 0) = 1 or equivalently that limAr_>oo pj^ii^) = 0. 

To show equation ( |20l ). we consider the event the group is not giving the decision at time t. We aim 
to show that the probability of this event goes to zero as — oo. Indeed we have that 

P [T / t] = P [T > t] = 1 - N) + poiiit, N)) , 

and, hence, P [T > t] = implies pi\i{t, N) + Po\i{t, N) = 1. Observe that 

Lf J 

For simplicity of notation, let us denote x := Po|i(^ ^^'^ V '■— Po|i(^- distinguish two cases, (i) 
X ^ y and (ii) x = y. 

Case X / y. We show that in this case there exists e > 0, depending only on x and y, such that 



x^y^ <{x + y- ef^ , for all j > 1. (21) 



First of all observe that, since [^^)x^y^ is just one term of the Newton binomial expansion of (x + y)^"*. 



we know that {^^■)x^y^ < (x + y)^-' for all j G N. Define e(j) := x + y — ( j') \/xy and observe 
that proving equation ((2TI) is equivalent to proving limj_s.oo e(j) > 0. Indeed if limj_j.oo e{j) > 0, then 
infjgNe(j) > and thereby we can define e := infjgpj To prove the inequality \mij^ao^{j) > 0, 
let us compute limj_j.oo (^/)^^'^^''^- By applying Stirhng's formula we can write 



lim \ = lim 



/ /^.x2i\V(2j) 
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and, in turn, limj^oo = x + y — I^Jxy. Clearly, if x / y, then x + y — > 0. Defining 

e := infjgN e(j), we can write 

lim I I ( I a;-'^"' (1 — x — y)^~'^-' < lim | | (x + y — e)^-' (1 — x — y)^ 



\N-2j 



lim ( ^ ) (x + y - e)-' (1 - X - y) 



< 

j=0 

lim (1 - e)^ = 0, 

N~^oo 



which implies also limAr_j>oo P [T > t] = 0. 

Case X = y. To study this case, let y = x + ^ and let ^ — )• 0. In this case, the probability of the 
decision time exceeding t becomes 



LfJ 

fix, N,0=P[T>^ = Y^ (^^^^ x^(x + 0^' (1 - 2x - 



N-2j 



j=0 

Consider lim^-^o f{x, N,^). We have that 

LfJ .... .... LfJ 



N-2J ^ , 



where the first inequality follows from ( j') < Yl'jLo Cj) = ^^-'^ and the second inequality follows from 

I ^ I 

Ejlo {2j}i^^y^ < ^f=o = 1- So lim^^of{x,N,^) exists, and since we know that also 

limiv->oo f{x,N,^) exists, the limits are exchangeable in limN^oo^^^^-^o f{x, N,^) and 

lim lim /(x, A^, e) = lim lim /(x,iV, ^)=0. 
Af->oo5->0 ^-i-OiV-s-oo 

This concludes the proof of equation ( [20l ). 

Assume now that > Po|i(^- We distinguish between the case where Pi\i{t) > | and the case 

where po|i(i) < Pi\iit) < I- 

If > th^n Lemma [A. 1 1 implies 



1. 



and, since limAr^^oo /3i|i(t, 0, 0) > ln-aN~>ooYlf=lN/2] (l " ^ we have also that 

limAr^oo/3ili(t,0,0) = 1. 
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The case Po|i(^ < < I more involved. We will see that in this case limAr_j,oo /?o|i(*) 0) 0) = 0- 



We start by observing that, from Lemma lA.U 



iT^ E (^)pi,®([i-Pi|i(t))"-^ = o, 



and in turn 



N 



■i=0 



N - j\ ■ 



^ - Pi\i{i) - Po\i{t) 



N-j-i 



The above expression can be written as follows 

N~2 . h 
lim /3i|i(t,0,0)= lim J] ( J] 



iV\ /iV - J 



^'Sl?'©?^!!!© 1(1- (Po|i(*>i|i(i)) 



AT-/! 



lim y 



N 



where, for obtaining the second equality we used the fact (^) {'^-j) ~ {^) (J) • Similarly, 



iV-2 



lim /3oii(t,0,0) = lim V 



iV-i>oo 



We prove now that Huin^oo /3o\i(.'^,0,0) = 0. To do so we will show that there exists e depending only 
on PQ\i{t) and Pi\i{t) such that 

/, N / \ h 



E 



^'0|/(*Vi|i(*) < (PO\l{t) + Pl\lit) - € 



To do so, let 



'-{h) = Po\i{t) + piiiit) - 



Because h is bounded, one can see that e{h) > as the sum inside the root is always smaller than 

(Po|i(i)+Pi|i(i))^ Also 



lim e{h) = Po\i{t) + Pi\i{t) 



Po\i{t) + Pl\l{t) 



e;4^j+iC)poV(^Viii(^") 

Po\i{t) +Pi\iit) 



^ {P0\l{t) + Plllit))' 
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as by Lemma lA.ll 

lim ^ 1 = 0. 

'^^^ ipo\i{t)+Pi\i{t)) 

Since by assumption, Poli(^ + Pi\i(t) > 0, we have that inihen^ih) > 0. By letting e := mfh^f^e{h), 
we conclude that 

^-2 /Ar\ / \ / \ N~h 



lim^/3o|i(t,0,0) < J] (J^^ +Po\i{t) - (^1 - - Po|i(i)) 

^ E (^) (^'ml*) +Po|i(i) - (l -Pi|i(t) - Poll©) = (1 - = 0. 



h=0 

This concludes the proof. ■ 

Remark IV.2 The earliest possible decision time t defined in (fT6l) is the best performance that the fastest 
rule can achieve in terms of number of iterations required to provide the final decision. □ 

B. The majority rule for varying values of N 

We consider now the majority rule, i.e., the counting rule with threshold q = \_N/2\ + 1. We start with 
the following result about the accuracy. Recall that Pvv|i is the probability of wrong decision by a single 
SDM and that p^™] [N) is the probability of wrong decision by a group of N SDMs implementing the 
majority rule (assuming Hi is the correct hypothesis). 

Proposition IV.3 (Accuracy under the majority rule) Consider the q out of N SDA algorithm under 
Assumption \IV.1\ Assume q = [N/2\ + 1, i.e., the majority rule is adopted. Without loss of generality, 

assume Hi is the correct hypothesis. Then the probability of error satisfies 

N 

j = LJV/2j+l ^ 

According to (1221 ). the following characterization follows: 

(i) if ^ ^ Pw|i < 1/2, then p^^^{N) is a monotonic decreasing function of N that approaches 
asymptotically, that is, 

pj;(Ar)>pH(iV + 2) and ^lirn^ pJJ (A^) = 0; 

(ii) if 1/2 < p^^i < 1, then p^^^{N) is a monotonic increasing function of N that approaches 1 
asymptotically, that is, 

p5;(Ar)<pH(Ar + 2) and lirn^ p'^^liN) = 1; 
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(iii) /f p„|i = 1/2, then p^^^{N) = 1/2; 

(iv) j/pwli < 1/4, 



Proof: We start by observing that 

s=l j=[N /2\+l ^ 

Since p5^)(iV) = Er= iPo|i(s; N,q — [N/2\ + 1), taking the Umit for TV — > oo in the above expression 
leads to 

A'" / \ 

i=rfi 

Facts (i), (ii), (iii) follow directly from Lemma IATT] in Appendix lAl applied to equation (l22l) . Equation (l23l) 
is a consequence of the Taylor expansion of ((22)) : 

i=rfi ^ ^ i=rfi 



Finally, Stirling's Formula implies limAr_s.oo ([^]) = \JlN jix 2^ and, in turn, the final expansion follows 
from 2^ = 4^21/2. ■ 
We discuss now the expected time required by the collective SDA algorithm to provide a decision when 
the majority rule is adopted. Our analysis is based again on Assumption II V. 1 1 and on the assumption that 
ifi is the correct hypothesis. We distinguish four cases based on different properties that the probabilities 
of wrong and correct decision of the single SDM might have: 

(Al) the probabiUty of correct decision is greater than the probability of wrong decision, i.e., Pc|i > Pw|i; 
(A2) the probability of correct decision is equal to the probability of wrong decision, i.e., -p^y^ = p^\i = 

1/2 and there exist to and ti such that '/ro|i(to) = 1/2 and vri|i(ti) = 1/2; 
(A3) the probability of correct decision is equal to the probability of wrong decision, i.e., = 

Pvi\i = 1/2 and there exists ti such that vrx|i(ti) = 1/2, while vro|i(t) < 1/2 for all t G N 

and limt_^oovro|i(t) = 1/2; 
(A4) the probability of correct decision is equal to the probability of wrong decision, i.e., = Pwli = 

1/2, and 7ro|i(t) < 1/2, 7ri|i(t) < 1/2 for all t G N and limt_j,oo vrgji = limt_j,oo vriji(t) = 1/2. 
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Note that, since Assumption IIV. 1 1 implies p^^i +Pw|i = 1» the probability of correct decision in case 
(Al) satisfies pc\i > 1/2. Hence, in case (Al) and under Assumption IIV. 1 [ we define t^i := max{t E 
N I 7ri|i(t) < 1/2} and t>i := mm{t £ N | Tri\i{t) > 1/2}. 



Proposition IV.4 (Expected time under the majority rule) Consider the q out of N SDA algorithm 
under Assumption \IV. 1\ Assume q = [N/2\ + 1, that is, adopt the majority rule. Without loss of generality, 
assume Hi is the correct hypothesis. Define the SDM properties (A1)-(A4) and the decision times to, ti, 
t^i and t^i as above. Then the expected decision time satisfies 



lim W.\T\Hi,N,q= \N/2\ 



< 



i<i +t>i + 1 



h + to 



if the SDM has the property (Al ), 
if the SDM has the property (A2), 



2 ' 

+00, if the SDM has the property (A3) or (A4). 

Proof: We start by proving the equality for case (Al). Since, in this case we are assuming p^^i > 
the definitions of t^i and t^i implies that 7ri|i(t) = 1/2 for all t^i < t < t-^i. Observe that 

N-h 



N 



J2Pi\iit;N,q=[N/2\+l)= 



s=l 



Hence Lemma |A. 11 implies 



lim Y,pi\i{t;N,q=[N/2\+l) = < 



N^oo 



1 

2' 



if t<t 



<-' 



if t> t 



>-' 



if t<i < t < t^i , 



and, in turn, that 



lim pi|i(t;iV,g= [iV/2j +1) 



1/2, 
0, 



if t = t^i+l and t = tyi, 



otherwise. 



It follows 



lim E[T\Hi,N,q= [N/2\ + 1] = lim t (po|i(t; iV, 9 = [N/2\ + 1) + iV, g = [N/2\ + 1)) 

1 



2 V <3 ' ' >2 

This concludes the proof of the equality for case (Al). 

We consider now the case (A2). Reasoning similarly to the previous case we have that 



lim pi|i(ti;iV,g= LiV/2j +l) = l/2 
TV— s-oo 



and lim po|i(io; N, q = lN/2\ + 1) = 1/2, 
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from which it easily follows that limAr^oo E [T\Hi,N, q = [N/2\ +1] = ^ (to + ^i)- 

For case (A3), it suffices to note the following implication of Lemma |A?T] if, for a given i G {0, 1}, 
we have 7ri|i(t) < 1/2 for all t G N, then limN^oo Pi\iit; N,q = [N/2\ + 1) = for all t G N. The 
analysis of the case (A4) is analogous to that of case (A3). ■ 

Remark IV.5 The cases where p^^i > p^.\i and where there exists to such that vro|i(to) = 1/2 while 
7ri|i(t) < 1/2 for all t G N and lim.t^oo'^i\i{t) = 1/2, can be analyzed similarly to the cases (Al) 
and (A3). Moreover, the most recurrent situation in applications is the one where there exists a time 
instant t such that TTi\i{t) < 1/2 and iTi\i{t + 1) > 1/2, which is equivalent to the above case (Al) with 
t>i = t<i + 1- In this situation we trivially have limAr^^oo E [T\Hi, N,q = \N/2]] = t>i . □ 

C. Fixed N and varying q 
We start with a simple result characterizing the expected decision time. 

Proposition IV.6 Given a group of N SDMs running the q out of A'" SDA, for j G {0, 1}, 

E[T\Hj,N,q= 1] < E[T\Hj,N,q = 2] < • • • < E[T\Hj,N,q = N]. 

The above proposition states that the expected number of iterations required to provide a decision 
constitutes a nondecreasing sequence for increasing value of q. Similar monotonicity results hold true 
also for Pc\j{N, q), p^ij{N, q), Pad\j{N, q) even though restricted only to [N/2\ + I < q < N. 

Proposition IV.7 Given a group of N SDMs running the q out of N SDA, for j G {0, 1}, 

p,\^{N, q = [N/2\ + 1) > p,\j{N, q = [N/2\ + 2) > • • • > p,|,(iV, q = N), 
p^\j{N, q = [N/2\ + 1) > p,|,(Ar, q = [N/2\ + 2) > • • • > p^\^{N, q = N), 
PndyiN, q = [N/2\ + 1) < p,^y{N, q = [N/2\ + 2) < • • • < p,^y{N, q = N). 

We believe that similar monotonic results hold true also for 1 < g < [N/ 2j . In particular, here is our 
conjecture: if N is odd, the single SDM has the almost-sure decision and the single SDM is more likely 
to provide the correct decision than the wrong decision, that is, p^.\j +Pw|j = 1 ^^d p^.\j > p^\j, then 

p,\,{N,q = 1) < p,\j{N,q = 2) < • • • < p,\j{N,q= [N/2\ + 1), 

p„|,(iV, g = 1) > p^\j{N, q = 2)>---> p^\j{N, q = [N/2\ + 1). 

These chains of inequalities are numerically verified in some examples in Section Ivl 
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V. Numerical analysis 

The goal of this section is to numerically analyze the models and methods described in previous 
sections. In all the examples, we assume that the sequential binary test run by each SDMs is the classical 
sequential probability ratio test (SPRT) developed in 1943 by Abraham Wald. To fix some notation, we 
start by briefly reviewing the SPRT Let X he a random variable with distribution f{x;9) and assume 
the goal is to test the null hypothesis Hq : 9 = 9q against the alternative hypothesis Hi : 6 = 9i. For 
i G {1, • • • ,N}, the i-th SDM takes the observations Xi{l),Xi{2),x{3), . . . , which are assumed to be 
independent of each other and from the observations taken by all the other SDMs. The log-likelihood 
ratio associated to the observation Xi{t) is 

A,W=logfi§|4^ (24) 

f[xi{t),9o) 

Accordingly, let Ai{t) = X]l=i ^iih) denote the sum of the log-likelihoods up to time instant t. The i-th 
SDM continues to sample as long as 770 < Aj(t) < rji, where 770 and rji are two pre-assigned thresholds; 
instead sampling is stopped the first time this inequality is violated. If Aj(t) < r/o, then the i-th SDM 
decides for 9 = 6q. If Ki{t) > r/i, then the i-th SDM decides for = 9i. 

To guarantee the homogeneity property we assume that all the SDMs have the same thresholds ryo 
and r/i. The threshold values are related to the accuracy of the SPRT as described in the classic Wald's 
method |[T9l . We shortly review this method next. Assume that, for the single SDM, we want to set the 
thresholds r/o and rji in such a way that the probabilities of misdetection (saying Hq when Hi is correct, 
i.e., P[say Hq\Hi]) and of false alarm (saying Hi when Hq is correct, i.e., P[say Hi\Hq\) are equal to 
some pre-assigned values Pmisdetection and Pfaise alarm- Wald proved that the inequaUties P[say i^o | -f^i] < 
Pmisdetection and P[say Hi I Hq] < pfaise alarm are achieved when 770 and r?i satisfy % < log j^^^^^^ and 
^1 > log As customary, we adopt the equality sign in these inequalities for the design of rjo 

and rji. Specifically, in all our examples we assume that Pmisdetecdon = Raise alarm = 0.1 and, in turn, that 

m = -m = log 9. 

We provide numerical results for observations described by both discrete and continuous random 
variables. In case X is a discrete random variable, we assume that f{x; 9) is a binomial distribution 



f{x;9) 



□ ^-(1-^)"--, if xG{0,l,...,n}, 

(25) 



0, otherwise, 
where n is a positive integer number. In case X is a continuous random variable, we assume that /(x; 9) 
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is a Gaussian distribution with mean 9 and variance cr^ 

/(^;^) = ^^e-(^-^)'/2-\ (26) 

The key ingredient required for the applicability of Propositions IIII. 1 1 and IIII.2I is the knowledge of the 
probabilities {fo|o(*))Pi|o(0}jgpj ^rid {po\i{t) , Pi\i{t)] Given thresholds r/o and r/i, there probabilities 
can be computed according to the method described in the Appendix IBll (respectively Appendix IB2I) for 
X discrete (respectively X continuous) random variable. 

We provide three sets of numerical results. Specifically, in Example IV. II we emphasize the tradeoff 
between accuracy and expected decision time as a function of the number of SDMs. In Example IV.2I we 
concentrate on the monotonic behaviors that the q out of N SDA algorithm exhibits both when N is fixed 
and q varies and when q is fixed and varies. In Example IV. 3 1 we compare the fastest and the majority 
rule. Finally, Section IV-AI discusses drawing connections between the observations in Example IV. 3 1 and 
the cognitive psychology presentation introduced in Section II-DI 

Example V.l (Tradeoff between accuracy and expected decision time) This example emphasizes the 
tradeoff between accuracy and expected decision time as a function of the number of SDMs. We do that 
for the fastest and the majority rules. We obtain our numerical results for odd sizes of group of SDMs 
ranging from 1 to 61. In all our numerical examples, we compute the values of the thresholds r]o and 
rji according to Wald's method by posing j^misdetection = j^faise alarm = 0.1 and, therefore, r]i = log 9 and 
r/o = - log 9. 

For a binomial distribution /(x; 6) as in ( [25] ). we provide our numerical results under the following 
conditions: we set n = 5; we run our computations for three different pairs {Oq, 6i); precisely we assume 
that 9o = 0.5 - e and 0i = 0.5 + e where e G {0.02,0.05,0.08}; and Hi : 9 = 0i is always the correct 
hypothesis. For any pair (^O) ^i) we perform the following three actions in order 

(i) we compute the probabilities {po| 1(^)1 Pi according to the method described in Appendix IBli 

(ii) we compute the probabilities {po\i{t; N, q),Pi\i{t; N, q)}^^^ for g = 1 and q = [N/2\ +1 according 
to the formulas reported in Proposition IIII. 1 1 

(iii) we compute probability of wrong decision and expected time for the group of SDMs exploiting 
the formulas 

00 00 
Pv.\i{N,q) = ^Po\i{t;N,q) and E[T\Hi, N,q] =J2iPo\i{t; N,q) + pi\i{t; N,q))t. 
t=i t=i 

According to Remark ilLTl since we consider only odd numbers N of SDMs, since q < and since 

each SDM running the SPRT has the almost-sure decisions property, then Pv/\i{N, q) +Pc\i{N, q) = 1. In 
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Other words, the probabiUty of no-decision is equal to and, hence, the accuracy of the SDA algorithms 
is characterized only by the probability of wrong decision and the probability of correct decision. In our 
analysis we select to compute the probability of wrong decision. 

For a Gaussian distribution /(x; 9, a), we obtain our numerical results under the following conditions: 
the two hypothesis are Hq : 6 = and i^i : ^ = 1; we run our computations for three different values of 
cr; precisely a G {0.5, 1,2}; and i^i : ^ = 1 is always the coiTcct hypothesis. To obtain Pvj\i{N,q) and 
E[T|i7i, N, q] for a given value of a, we proceed similarly to the previous case with the only difference 
that {po|i(0)Pi|i(0}igpj ^re computed according to the procedure described in Appendix IB2I 

The results obtained for the fastest rule are depicted in Figure IV. 1[ while the results obtained for the 
majority rule are reported in Figure [3] 




Fig. 2. Behavior of the probability of wrong decision and of the expected number of iterations required to provide a decision 
as the number of SDMs increases when the fastest rule is adopted. In Figure (a) we consider the binomial distribution, in Figure 
(b) the Gaussian distribution. 

Some remarks are now in order. We start with the fastest rule. A better understanding of the plots in 
Figure [vni can be gained by specifying the values of the earliest possible decision time t defined in ( fT6l ) 
and of the probabilities Pi\i{t) and Po|i(^- our numerical analysis, for each pair (^O) ^i) considered and 
for both discrete and continuous measurements X, we had t = 1 and Pi\i{t) > Po|i(^- expected from 
Proposition |IV. 1 1 we can see that the fastest rule significantly reduces the expected number of iterations 
required to provide a decision. Indeed, as N increases, the expected decision time ¥,[T\Hi, N,q = 1] 
tends to 1. Moreover, notice that p^^]^-^{N) approaches 0; this is in accordance with equation (fTTl) . 

As far as the majority rule is concerned, the results established in Proposition IIV.3I and in Proposi- 
tion |IV4] are confirmed by the plots in Figure [3] Indeed, since for all the pairs (^O) ^i) we have considered. 
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Fig. 3. Behavior of the probability of wrong decision and of the expected number of iterations required to provide a decision 
as the number of SDMs increases when the majority rule is adopted. In Figure (a) we consider the binomial distribution, in 
Figure (b) the Gaussian distribution. 



we had p^^i < 1/2, we can see that, as expected from Proposition II V. 3 [ the probabiUty of wrong decision 
goes to exponentially fast and monotonically as a function of the size of the group of the SDMs. 
Regarding the expected time, in all the cases, the expected decision time E,[T\Hi, N,q = [N/2\ + 1] 
quickly reaches a constant value. We numerically verified that these constant values corresponded to the 
values predicted by the results reported in Proposition IIV.4I 

Example V.2 (Monotonic behavior) In this example, we analyze the performance of the general q out 
of N aggregation rule, as the number of SDMs N is varied, and as the aggregation rule itself is varied. We 
obtained our numerical results for odd values of N ranging from 1 to 35 and for values of q comprised 
between 1 and [N/2\ + 1. Again we set the thresholds r]o and r]i equal to log(— 9) and log 9, respectively. 
In this example we consider only the Gaussian distribution with cr = 1. The results obtained are depicted 
in Figure IH where the following monotonic behaviors appear evident: 

(i) for fixed N and increasing q, both the probability of correct decision and the decision time increases; 

(ii) for fixed q and increasing N, the probabiUty of correct decision increases while the decision time 
decreases. 

The fact that the decision time increases for fixed N and increasing q has been established in Proposi- 
tion IIV.6I The fact that the probability of correct decision increases for fixed N and increasing q validates 
the conjecture formulated at the end of Section IIV-CI 

Example V.3 (Fastest versus majority, at fixed group accuracy) As we noted earlier. Figures |V.1| |3] 
show that the majority rule increases remarkably the accuracy of the group, while the fastest rule decreases 
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Fig. 4. Probability of correct detection (left figure) and expected decision time (right figure) for tlie q out of N rule, plotted 
as a function of network size A'^ and accuracy threshold q. 



remarkably the expected number of iteration for the SDA to reach a decision. It is therefore reasonable 
to pose the following question: if the local accuracies of the SDMs were set so that the accuracy of the 
group is the same for both the fastest and the majority fusion rule, which of the two rules requires a 
smaller number of observations to give a decision. That is, at equal accuracy, which of the two rules is 
optimal as far as decision time is concerned. 

In order to answer this question, we use a bisection on the local SDM accuracies. We apply the 
numerical methods presented in Proposition IIII. 1 1 to find the proper local thresholds that set the accuracy 
of the group to the desired value Different local accuracies are obtained for different fusion rules 
and this evaluation needs to be repeated for each group size N . 

In these simulations, we assume the random variable X is Gaussian with variance a = 2. The two 
hypotheses we. Hq : 9 = Q and Hi : 9 = 1. The numerical results are shown in Figure [5] and discussed 
below. 

As is clear by the plots, the strategy that gives the fastest decision with the same accuracy varies with 
group size and desired accuracy. The left plot in Figure |5] illustrates that, for very high desired group 
accuracy, the majority rule is always optimal. As the accuracy requirement is relaxed, the fastest rule 
becomes optimal for small groups. Moreover, the group size at which the switch between optimal rules 
happens, varies for different accuracies. For example, the middle and right plot in Figure [5] illustrate 
that while the switch happens at = 5 for a group accuracy p^^^ = p[^J-^ = 0.05 and at A^ = 9 for 

(m) (f) n 1 
P i' = p L = 0.1. 

We summarize our observations about which rule is optimal (i.e., which rule requires the least number 
of observations) as follows: 
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Pnetwork = 0.02 Pnetwork = 0.05 Pnetwork = 0.1 




Network size Network size Network size 



Fig. 5. Expected decision time for the fastest and tlie majority rules versus group size A'^, for various network accuracy levels. 

(i) the optimal rule varies with the desired network accuracy, at fixed network size; 

(ii) the optimal rule varies with the desired network size, at fixed network accuracy; and 

(iii) the change in optimality occurs at different network sizes for different accuracies. 

A. Decision making in cognitive psychology revisited 

In this section we highlight some interesting relationships between our results in sequential decision 
aggregation (SDA) and some recent observations about mental behavior from the cognitive psychology 
literature. Starting with the literature review in Subsection II-DI our discussion here is based upon the 
following assumptions: 

(i) SDA models multi-modal integration in cognitive information processing (CIP), 

(ii) the number of SDMs correspond to the number of sensory modalities in CIP, 

(iii) the expected decision time in the SDA setup is analogous to the reaction time in CIP, and 

(iv) the decision probability in the SDA setup is analogous to the firing rate of neurons in CIP. 
Under these assumptions, we relate our SDA analysis to four recent observations reported in the CIP 
literature. In short, the fastest and majority rules appear to emulate behaviors that are similar to the ones 
manifested by the brain under various conditions. These correspondences are summarized in Table U and 
described in the following paragraphs. 

First, we look at the observation in CIP that multi-modal sites can exhibit suppressive behaviors (first 
row in Table IHl. We find that suppressive behavior is not contradictory with the nature of such a site. 
Indeed, Proposition |IV. 1 [ describes situations where an increased group size degrades the decision accuracy 
of a group using the fastest rule. 

Second, we look at the observation in CIP that, for some high-intensity stimuli, the firing rate of 
multi-modal integration sites is similar to the firing rate of uni-modal integration sites (second row in 
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Multi-sensory integration sites (cognitive psychology) 


Sequential decision aggregation (engineering sciences) 


Suppressive behavior of firing rate 


Decision probability decreases with increasing A'^ 


Sub-additive behavior of firing rates 


Probability of decision slightly increases with increasing A'^ 


Additive behavior of firing rates 


Decision probability linearly increases with increasing A'^ 


Super-additive behavior of firing rates 


Decision probability exponentially increases with increasing N 



TABLE I 

Cognitive Psychology - Engineering dictionary of correspondences. 



Table IJ). This similarity behavior appears related to behaviors observed in Figure [S] The second and third 
plots in Figure |5] illustrate how, in small groups with high individual accuracy and relatively low group 
decision accuracy, the fastest rule is optimal. Since a multi-model integration site implementing a fastest 
aggregation rule behaves similarly to a uni-modal integration site, our result give a possible optimality 
interpretation of the observed "multi-modal similar to uni-modal" behavior. 

Third, we look at the observation in CIP that activation of multi-modal integration sites is often 
accompanied with an increase in the accuracy as compared to the accuracy of a uni-sensory integration 
site (third and forth rows in Table IHl. The first plot in Figure [5] shows that when the required performance 
is a high accuracy, the majority rule is better than the fastest. Indeed Proposition IIV.3I proves that, for 
the majority rule, the accuracy monotonically increases with the group size, sometimes exponentially. 

Fourth, we look at the observation in CIP that, even under the same type of stimuli, the stimuli strength 
affects the additivity of the neuron firing, i.e., the suppressive, additive, sub-additive or super-additive 
behavior of the firing rates. Additionally, scientists have observed that depending on the intensity of the 
stimuli, various areas of the brain are activated when processing the same type of stimuli |[T3l . |[T4l . |[T2l . 
|[T5l . A possible explanation for these two observed behaviors is that the brain processes information in 
a way that maintains optimality. Indeed, our comparison in the middle and right parts of Figure [5] shows 
how the fastest rule is optimal when individual SDMs are highly accurate (strong and intact stimuli) 
and, vice versa, the majority rule is optimal when individual SDMs are relatively inaccurate (weak and 
degraded stimuli). 

We observed in the middle and right part of Figure |5] that, for high individual accuracies, the fastest 
rule is more efficient than the majority rule. We reach this conclusion by noting two observations: first, 
smaller group sizes require higher local accuracies than larger group sizes in order to maintain the same 
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group accuracy; second, the fastest rule is optimal for small groups while the majority rule is always 
optimal for larger groups. 

VI. Conclusion 

In this work, we presented a complete analysis of how a group of SDMs can collectively reach a 
decision about the correctness of a hypothesis. We presented a numerical method that made it possible 
to completely analyze and understand interesting fusion rules of the individuals decisions. The analysis 
we presented concentrated on two aggregation rules, but a similar analysis can be made to understand 
other rules of interest. An important question we were able to answer, was the one relating the size of 
the group and the overall desired accuracy to the optimal decision rules. We were able to show that, no 
single rule is optimal for all group sizes or for various desired group accuracy. We are currently extending 
this work to cases where the individual decision makers are not identical. 
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Appendix 

A. Asymptotic and monotonicity results on combinatorial sums 

Some of the results provided for the fastest rule and for the majority rule are based on the following 



Lemma A.l (Properties of half binomial expansions) For an odd number G N, and for real num- 
bers c G ffi and x G M satisfying < c < 1 and < x < c/2, define 



1945. 



properties of the binomial expansion (x + y)^ = Ylij=o {^j)x^y^ ^ 




and 




The following statements hold true: 



(i) /f < X < c/2, then, taking limits over odd values of N, 




and 




(ii) if X = c/2, then 




(iii) if c = 1 and < x < 1 /2, then 



5(A + 2;l,x) < 5(A;l,x) 



and S{N + 2; 1, x) > S{N; 1, x). 
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Proof: To prove statement (i) we start with the obvious equality = {c — x + x)^ = S_{N; c, x) + 
S{N;c,x). Therefore, it suffices to show that liniAr^oo = 0. Define the shorthand h{j) := 

(^)x-'(c — x)^~^ and observe 

h{j) j!(7vlj)!^"'(c — x)^ ^ j _|_ 1 c — X 



It is straightforward to see that > 1 <^=^ cj - xN + c - x > <;=^ J > ^ - -^^7^- 

Moreover, if j > f and < x < §, then j-^ + ^^>f-^ + ^^>f-f + ^>0. 
Here, the second inequality follows from the fact that — ^ >— YifO<x<|. In other words, 
if j > ^ and < x < |, then fj^^j^^i-^ > 1- This result implies the following chain of inequalities 
/ ( [Af/2] ) > / ( \N/2\ +!)>•••> h{N) providing the following bound on S{N; c, x) 

, o-Hc-xr^^ riv/2i(^^^,^)x™(c-x)w^j 

S{N;c,x) = ^ < ^ . 

Since (p^2]) < 2^, we can write 

2iV W21 r _ )[N/2i f2x\ ^^/^^ /2fc - xU L^^^J 

S{N-c,x)<\N/2\'—^ = \N/2\[^^^ 1'^' 

^^^2^,-2x^^2x^™^2(c-x)^L^/^J 



c / 

Let a = ^ and (3 = 2 (^^) and consider a • /? = ^^(^~^) , One can easily show that a • /3 < 1 since 
4cx — 4x^ — = — (c — 2x)^ < 0. The proof of statement (i) is completed by noting 



lim 5(7V;c,x) < lim \N/2\ (—\{a- /3)L^/2J ^ 

The proof of the statement [(u)] is straightforward. In fact it follows from the symmetry of the expressions 
when X = |, and from the obvious equality Y^^=q ~ = ■ 

Regarding statement |(iii)[ we prove here only that S{N + 2; 1, x) < S{N; 1, x) for < x < 1/2. The 
proof of S_{N + 2; 1, x) > S_{N; 1, x) is analogous. Adopting the shorthand 

^ /AT\ 

\x\l-x)^-' 



we claim that the assumption < x < 1/2 implies 

A{N,x) := f{N + 2,x)- f{N,x) <0. 
To establish this claim, it is useful to analyze the derivative of A with respect to x. We compute 

^{N,x)= J] if^y^'\i-xf-^- {N-i)(^y\l-xf-'-' + Nx''-\ (27) 

^ i=\N/2'] ^ ^ ^ i=\N/2'] ^ * ^ 
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The first sum ^ 



N-l 



--\N/2-] 



X* ^(1 — x)^ ' in the right-hand side of (|27] ) is equal to 





-N- 


(W2l) 


~2 



N-l 



,\N/2\-l 



\ — X 



,N-\N/2\ 



i=\N /2\+l 

Moreover, exploiting the identity {i + = [N - 

Af-l /^r\ N-2 



x'-Ul-x)^-\ 



E 



i=[N /2-]+l 



x'~\l-x) 



N- 



i=\N/2-] 
N-2 



N 
i + l 



x\l-x)^~'-^ 

"-^ /i\f\ 

{N-^)^^jx\l-xy 



\N-i-l 



i=[N/2 

The second sum in the right-hand side of (l27l ) can be rewritten as 



Now, many terms of the two sums cancel each other out and one can easily see that 



dl 
dx 



iN,x) 



N 
\N/2] 



\N/2]x 



(1 _ ^f-W2^ = (,J,,) \N/2] {x (1 - x))r^/2l-i ^ 



where the last equality relies upon the identity N — = [N/2\ = \N/2'\ — 1. Similarly, we have 



f N + 2 
\\N/2]+l 



(riV/21 +l)(x(l-x))™ ^ 



Hence 



{\N/2] +l)x(l-x) 



( ^ 



\N/2] 



Straightforward manipulations show that 

f N + 2 



\\N/2'] +1 



(riV/2l+l)=4^riV/2l 



\N/2]J' 



and, in turn, 

dA. 



dx 



{N,x) 





'N' 


(riv/2i) 


~2 



4^x(l-x)-l 
iV + 1 ^ ' 



--■.g{N,x) 



N + 2 , 

\— tX(\ - X) -\ 



N + \ 

where the last equality defines the function g{N,x). Observe that x > implies g{N,x) > and, 
otherwise, x = implies g{N,x) = 0. Moreover, for all A^, we have that f{N, 1/2) = 1/2 and 
f{N, 0) = and in turn that A{N, 1/2) = A(iV, 0) = 0. Additionally 



OA 

dx 



{N, 1/2) = g{N, 1/2) 



N + 2 
N + 1 



1 > 
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and 

^(iV,0) = and ^(Ar,o+) = 5(iV,0+)(0+-l) <0. 



The roots of the polynomial x i— )• Ajj^x{l—x) — 1 are ^ ^1 it y 7vq::2 j ' which means that the polynomial 
has one root inside the interval (0, 1/2) and one inside the interval (1/2, 1). Considering all these facts 
together, we conclude that the function x i— )• A(A^, x) is strictly negative in (0,1/2) and hence that 
f{N + 2,x)-f{N,x)<Q. ■ 

B. Computation of the decision probabilities for a single SDM applying the SPRT test 
In this appendix we discuss how to compute the probabilities 

{Pnd|o} U {po|o(i)>Pl|o(0}jgN {^'ndll} U {po|l (0 > (*) ligj^ ^^^^ 

for a single SDM applying the classical sequential probability ratio test (SPRT). For a short description 
of the SPRT test and for the relevant notation, we refer the reader to Section jV] We consider here 
observations drawn from both discrete and continuous distributions. 

1) Discrete distributions of the Koopman-Darmois-Pitman form: This subsection review the procedure 
proposed in [51 for a certain class of discrete distributions. Specifically, lH provides a recursive method 
to compute the exact values of the probabilities (1281 ): the method can be applied to a broad class of 
discrete distributions, precisely whenever the observations are modeled as a discrete random variable of 
the Koopman-Darmois-Pitman form. 

With the same notation as in Section |Vl let X be a discrete random variable of the Koopman-Darmois- 
Pitman form; that is 

h{x)exp{B(9)Z{x)- A{e)), if x £ Z, 

0, if X ^ Z, 

where h{x), Z{x) and A{6) are known functions and where Z is, & subset of the integer numbers Z. 
In this section we shall assume that Z{x) = x. Bernoulli, binomial, geometric, negative binomial and 
Poisson distributions are some widely used distributions of the Koopman-Darmois-Pitman form satisfying 
the condition Z{x) = x. For distributions of this form, the likelihood associated with the t-th observation 
x{t) is given by 

\{t) = {B{e,) - B{eo))xit) - {A{ei) - A{eo)). 

Let rjo,rji be the pre-assigned thresholds. Then, one can see that sampling will continue as long as 

m + t{A{9i)-A{9o)) vi + t{A{ei)-A{9o)) 



fix, 9) 



< 



V xH) < 'n^'K^y'^v-^K^o)) (29) 



B{9,)-B{9o)) ^ B{9,)-B{9o)) 
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for B{9i) - B{6o) > 0; if S(6'i) - B{9o) < the inequalities would be reversed. Observe that Yh=i ^i'^) 
is an integer number. Now let j/q*^ be the smallest integer greater than {r^o + t{A{9i) — A{6q))} / {B{6i) — 
B{9o)) and let 7?f^ be the largest integer smaller than {rji + t{A{9i) - A{9o))} /{B{9i) - B{9q)). 
Sampling will continue as long as 7?q*^ < X{t) < fff' where X{t) = X]i=i^(^)- Now suppose that, 
for any I G [?7o*^^^*^] the probability ¥[X{t) = £\ is known. Then we have 

nx{t+i) = im= f{i-j-9i)¥[x{t)=j\Hi], 



■ -(*) 

3=Vo 



and 



-(*) 
m 



■ _(t) -(t) . . , 



Po\^{t + l)=Y. E nX{t)=j\Hi\f{r-^ 



3=4:' '■=-~ 

Starting with P[^(0) = 1], it is possible to compute recursively all the quantities {pi\j{t)^^^ and 
W[X{t) = e], for any t G N, ^ G [^io^Vi^l and {Pi\j{t)}'^-^^. Moreover, if the set Z is finite, then the 
number of required computations is finite. 

2) Computation of accuracy and decision time for pre-assigned thresholds rjQ and rji: continuous 
distributions: In this section we assume that X is a continuous random variable with density function 
given by f{x,9). As in the previous subsection, given two pre-assigned thresholds ryo and r/i, the goal 
is to compute the probabilities Pi\j{t) = P[sayifi|ffj, T = t], for i,j G {1,2} and t G N. 

We start with two definitions. Let f\ 0^ and fA(t),0, denote, respectively, the density function of the 
log-likelihood function A and of the random variable A(t), under the assumption that Hi is the correct 
hypothesis. Assume that, for a given t G N, the density function fj\{t)fi. is known. Then we have 

/awa('S) = / f\eXs - x)fK{t),eM)dx, s G (?7o,?7i) , 

and 

Pi\i{t) = I / fxAi^)dz] fA{t),e.{x)dx, and pQ^i{t) = I / fx,eAz)dz j f^t),eXx)dx. 

In what follows we propose a method to compute these quantities based on a uniform discretization of 
the functions A and A. Interestingly, we will see how the classic SPRT algorithm can be conveniently 
approximated by a suitable absorbing Markov chain and how, through this approximation, the probabilities 
i,j G {1,2}, can be efficiently computed. Next we describe our discretization approach. 
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First, let 6 £ M>o, f]o = [^\5 and 7?i = \^]6. Second, for n = [^] - [^J + 1, introduce the sets 
5 = {si, . . . and F = {7.^+2, 7-n+3, 7-i, 7o, 7i: 7n-3, 7n-2} , 

where Si = f]Q + {i — 1)6, for i G {1, . . . , n}, and 7^ = i6, for i G {— n + 2, —n + 3, . . . , n — 3, n — 2}. 
Third, let A (resp. A) denote a discrete random variable (resp. a stochastic process) taking values in F 
(resp. in S). Basically A and A represent the discretization of A and A, respectively. To characterize A, 
we assume that 



[A = i6] 



S 6 

i6 < X< i6 + - 

2 - - 2 



i G {— n + 3, . . . , n — 3} 



and 

P [A 



-n 



+ 2)6] 



A < {-n + 2)6 + - 



and P [A = (n - 2)6] 



\>{n-2)6-- 



From now on, for the sake of simplicity, we shall denote P [A = 16] by pi. Moreover we adopt the 
convention that, given Si £ S and 7^ G F, we have that Si + 7j := ?7o whenever either i = 1 or 
i + j — 1 < 1, and Si + 7j := fji whenever either i = n or i + j — 1 > n. In this way Si + 7j is always 
an element of S. Next we set K{t) := X]/i=i A(/i). 

To describe the evolution of the stochastic process A, define the row vector 7r(t) = [7ri(t), . . . , 7r«(i)]-^ G 
M^^" whose i-th component 7rj(t) is the probability that A equals Si at time t, that is, 7rj(t) = P [A(t) = Si] . 
The evolution of 7r(t) is described by the absorbing Markov chain (5, A, 7r(0)) where 

• 5 is the set of states with si and s„ as absorbing states; 

m A = [ojj] is the transition matrix: denote the probability to move from state Sj to state Sj and 
satisfy, according to our previous definitions and conventions, 

Ann — li Oil 



0, for i G {2, . . . , n} and j G {1, . . . , n — 1}; 

^n-2 



- an = 

- an = Y.s=^n+2Ps and a-in = YJlZtVs-, /j G {2, . . . , n - 1}; 

- aij = pj-i ij £ {2, . . . ,n - 1}; 

• 7r(0) is the initial condition and has the property that P[A(0) = 0] = 1. 
In compact form we write 7r(t) = 7r(0)A*. 

The benefits of approximating the classic SPRT algorithm with an absorbing Markov chain (5, A, vr(0)) 
are summarized in the next Proposition. Before stating it, we provide some useful definitions. First, let 
Q G M'^""^)^'^""^' be the matrix obtained by deleting the first and the last rows and columns of A. 
Observe that / — Q is an invertible matrix and that its inverse F := {I — Q)~^ is typically known in the 
literature as the fundamental matrix of the absorbing matrix A. Second let A^2-li-i ^^d ^2"n-i denote. 
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respectively, the first and the last column of the matrix A without the first and the last component, i.e., 
^2:n-i '■= ["2,1' • • • ' an-i,i] and := [a2,n, • • • , an-i,n] ■ Finally, let e^mj+i and 1„_2 denote, 

respectively, the vector of the canonical basis of M"~^ having 1 in the ([^J + l)-th position and the 
(n — 2) -dimensional vector having all the components equal to 1 respectively. 

Proposition A.2 (SPRT as a Markov Chain) Consider the classic SPRT test. Assume that we model it 
through the absorbing Markov chain 7r(0)) described above. Then the following statements hold: 

(i) Po\j{t) = - vri(t - 1) and Pi\j{t) = 7r„(t) - 7r„(t - 1), for t G N; 

(ii) ¥[say Ho\Hj] = Cp^j^^A^ai and F[say HqIMj] = e'^™. j^„]^A^a„; and 

(iii) E[T\Hj] = eJ,_^. Fln^2. 
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